Bayesian Methods for Mixture Modeling

نویسنده

  • Valeriu Savcenco
چکیده

This Master’s thesis is mostly focused on Bayesian methods for the selection and testing of discrete mixture models. The main problem that is studied in the project is the analysis of data sets of several categorical variables (e.g. test items, symptoms, genes) collected on a set of subjects. We fit a discrete mixture model to the data which means that the dependencies among the different variables are captured by a latent categorical variable. We assume that the manifest variables are independent given the latent variable. We implemented a Matlab program an exploratory data tool that searches for latent classes of interest for given data sets. Another problem studied in the project is the possibility of having missing observed data. For solving this problem we introduce an additional step in Gibbs sampling in which the values for the missing data are sampled. The methods described in the thesis are applied for a psychiatric diagnostics data set and for a data set containing information from schizophrenia affected and healthy persons. Bayesianska metoder för identifiering av sammansatta fördelningar

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A two-component Bayesian mixture model to identify implausible gestational age

Background: Birth weight and gestational age are two important variables in obstetric research. The primary measure of gestational age is based on a mother’s recall of her last menstrual period. This recall may cause random or systematic errors. Therefore, the objective of this study is to utilize Bayesian mixture model in order to identify implausible gestational age.   Methods: ...

متن کامل

Mixture Block Methods for Non Parametric Bayesian Models with Applications

OF THE DISSERTATION Mixture Block Methods for Non Parametric Bayesian Models with Applications By Ian Porteous Doctor of Philosophy in Computer Science University of California, Irvine, 2010 Professor Max Welling, Chair This study brings together Bayesian networks, topic models, hierarchical Bayes modeling and nonparametric Bayesian methods to build a framework for efficiently designing and imp...

متن کامل

Networks of Mixture Blocks for Non Parametric Bayesian Models with Applications

OF THE DISSERTATION Networks of Mixture Blocks for Non Parametric Bayesian Models with Applications By Ian Porteous Doctor of Philosophy in Information and Computer Science University of California, Irvine, 2010 Professor Max Welling, Chair This study brings together Bayesian networks, topic models, hierarchical Bayes modeling and nonparametric Bayesian methods to build a framework for efficien...

متن کامل

‎A Bayesian mixture model‎ for classification of certain and uncertain data

‎There are different types of classification methods for classifying the certain data‎. ‎All the time the value of the variables is not certain and they may belong to the interval that is called uncertain data‎. ‎In recent years‎, ‎by assuming the distribution of the uncertain data is normal‎, ‎there are several estimation for the mean and variance of this distribution‎. ‎In this paper‎, ‎we co...

متن کامل

Factorized Asymptotic Bayesian Inference for Mixture Modeling

This paper proposes a novel Bayesian approximation inference method for mixture modeling. Our key idea is to factorize marginal log-likelihood using a variational distribution over latent variables. An asymptotic approximation, a factorized information criterion (FIC), is obtained by applying the Laplace method to each of the factorized components. In order to evaluate FIC, we propose factorize...

متن کامل

Bayesian and Iterative Maximum Likelihood Estimation of the Coefficients in Logistic Regression Analysis with Linked Data

This paper considers logistic regression analysis with linked data. It is shown that, in logistic regression analysis with linked data, a finite mixture of Bernoulli distributions can be used for modeling the response variables. We proposed an iterative maximum likelihood estimator for the regression coefficients that takes the matching probabilities into account. Next, the Bayesian counterpart...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003